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[57] ABSTRACT 

A data processor (3) implements a single system stack 
pointer (244) to create records of both supervisor and user 
stack operations when hardware support for misalignment of 
such stack operations is optional The single system stack 
pointer is implemented as a self-aligning stack pointer which 
automatically aligns itself to a nearest 0-modulo-4 address 
below a current address setting such that no alignment 
system errors occur even when there is no hardware support 
for misaligned operands. Once the automatic alignment has 
occurred, the data processor stores a format field in an 
exception stack frame to indicate information about the 
alignment of the stack pointer at the time of the error. When 
the exception has been serviced, the processor uses the four 
bit format field stored in the exception stack frame to restore 
the stack pointer to its original value at the time of the 
exception. 

12 Claims, 6 Drawing Sheets 
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DATA PROCESSING SYSTEM AND METHOD during the processing of any other exception will produce a 

FOR STORING AND RESTORING A STACK total system failure and simply cause the processor to halt 

POINTER Often included in this special set of catastrophic fault 

conditions is the alignment of the system stack pointer on 

CROSS REFERENCE TO A RELATED 5 certain boundaries. In order to compensate for this 

APPLICATION possibility, prior art systems have implemented several 

different solutions which require a significant amount of 

This application is related to our commonly assigned hardware. For example, most prior art solutions require a 

patent entitled: hardware circuit which ensures that no operands will be 

"Method and Apparatus for Unstacking Registers in a 10 misaligned. These solutions have typically implemented 

Data Processing System." U.S. Pat. No. 5,640.548. assigned multiple stack pointers based on the operating mode of the 

to Motorola. Inc.. and having inventors John A. Langan et al. processor. This operating mode typically defines the level of 

1. Field of the Invention priWlege associated with the executing program, and in its 

™ # . . . . 4 ^ simplest form usually consists of two levels of privilege: 

The present invention relates in general to a data process- supervisor (ffi0St privileged) ^ uscr (lcast p^g^ 

^system, and more particularly to a stack pointer in a data 15 > supcr ^ sor ^ provides complete accei to 

processing sys m, all the resources of the system, while user mode execution 

2. Background of the Invention i s denied access to certain hardware resources. In addition. 
In data processing systems, stack pointers are hardware many processors provide dedicated hardware circuitry to 

registers typically used as a memory address pointer for enforce a required alignment of the stack pointer in the data 

defining the memory locations for program variables and processing system. Note, in each of the prior art solutions, 

temporary storage. Traditionally, one or more registers the dedicated hardware circuitry and the corresponding 

within the programming model is used by the hardware as overhead costs are required. 

the system stack pointer. In many systems, the stack pointer For data processing systems which are unable to imple- 

is implicitly used by the instruction set during the execution ment such expensive hardware circuitry, software solutions 

of certain operations. Generally, this address register defines have been developed. When using such a software solution, 

the top of a current stack frame where operands can be concurrent exceptions are not allowed. Stated another way, 

passed between functions by "pushing" the data onto the once an exception has been taken by the data processing 

memory stack. This push operation can consist of decre- system, other exceptions are not allowed. Otherwise, if 

menting the stack pointer and then copying the data to the another exception may not be masked and is asserted, then 

location defined by the new stack pointer address. After the the status and program counter information of the machine 

data has been used by a routine, the data is deallocated by context will be overwritten. Thus, the data processing sys- 

simply incrementing the stack pointer to return it to a tern will be unable to return to the state before the first 

previous value. This process is known as "popping" the exception was processed. Thus, in this second type of prior 
stack - 35 art system, the data processing system is unable to effec- 

Since these stack pointers generally define the memory tively process all exceptions correctly, 
addresses, user programs routinely manipulate the contents As can be seen, present prior art solutions require signifi- 
ed the stack pointer which defines the top of the stack. cant additional hardware circuitry to correctly restore a data 
Operands of various sizes, such as bytes, words, longwords. processing system to its initial state after an exception is 
etc., may be pushed onto me stack so that the top of the stack ^ asserted and processed. Therefore, a need exists for a 
may point to any allowable alignment at any time. Ftoces- solution which allows a designer of the data processing 
sors which provide automatic hardware support of any system to optionally include a hardware support for operand 
operand size on any operand address alignment are said to misalignment while still ensuring that the data processing 
support misalignment An operand is said to be misaligned system will correctly process all exceptions, regardless of 
if its memory address does not correspond to its natural 4J system stack pointer alignment, and recover the initial state 
boundary. The natural boundary is simply defined as an after exception handling. 

address 0-modulo-<operand size). For example, a sixteen bit rrtrf nPSPRTimmsj nu tttr hd Awtwrc 

word operand is aligned if the memory byte address is DESCRIPTION OF THE DRAWINGS 

0-modulo-2 and a thirty-two bit longword is aligned if the 1 illustrates, in block diagram form, a data process- 
memory bye address is 0-modulo-4. Alignments which fail ^ * n S system in accordance with one embodiment of the 

this definition result in misaligned operands. present invention; 

Most processors include hardware support for various HG - 2 illustrates, to block diagram form, a portion of the 

types of exception processing. These exceptions, also known central processing unit of the data processing system illus- 

as faults, typically include interrupts, system calls, and mtcd in FIG. 1; 

arithmetic error conditions. Other exceptions may also 55 RG - 3 illustrates, in block diagram form, an operand fetch 

occur. Generally, when an exception is detected, the proces- circuit of the central processor unit illustrated in FIG. 2; 

sor will create a special exception frame on the top of the FIG. 4 illustrates, in block diagram form, a register file 

system stack detailing the context of the machine at the time circuit of the operand fetch circuit illustrated in FIG. 3; 

of the fault. After this exception stack frame is created, the FIG. 5 illustrates, in block diagram form, an address 
processor passes control to a special exception handling $o generation/execution circuit of the central processor unit 

routine by transferring control to the instruction address illustrated in FIG. 2; 

specified by the exception type. Generally, the exception FIG. 6 illustrates, in block diagram form, a portion of an 

handling routine will process the fault, perhaps initiating other execution engine of the address generation/execution 

some type of corrective action, and then return to the circuit illustrated in FIG. 4; 

application executing at the time of the exception. 65 FIG. 7 illustrates, in block diagram form, a structure of an 

Most processors employ some type of special error detec- exception stack frame for a plurality of stack pointer (SP) 

tion so that certain catastrophic fault conditions occurring register values; and 
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FIG. 8 illustrates, in block diagram form, a structure of an 
exception stack frame which includes a four bit format field. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

The present invention provides a data processor and 
method of operating a data processing system in which a 
single system stack pointer may be used to create records of 
both system and user stack operations when hardware sup- 
port for alignment of such stack operands is optional. The 
single system stack pointer is implemented as a self -aligning 
stack pointer. Stated simply, this self-aligning stack pointer 
automatically aligns itself to a nearest 0-modulo-4 address 
below the current address setting such that no alignment 
system errors occur even when there is no hardware support 
for misaligned operands. Once the automatic alignment has 
occurred, the data processor of the present invention stores 
a four bit format field in an exception stack frame to indicate 
information about the alignment of the stack pointer at the 
time of the fault. When the exception has been serviced and 
a return from exception (KTE) instruction is executed, the 
processor uses the four bit format field stored in the excep- 
tion stack frame to restore the stack pointer to its original 
value at the time of the exception. 

The present invention provides a data processor and 
method of operation in which a single stack pointer is used 
to support both the supervisor and user stack areas of a data 
processing system. Furthermore, in the present invention, 
hardware dedicated to operand misalignment may be 
implemented, but is not required. The ability of this single 
stack pointer to align itself during exception processing 
allows the data processor to ensure that the data processing 
system will not experience a catastrophic error if an excep- 
tion occurs while the stack pointer is misaligned. Operation 
of the present invention will subsequently be discussed in 
greater detail. 

In a following discussion, the connectivity and operation 
of the data processing system which implements the present 
invention will be provided in greater detail. 
Connectivity of the Present Invention 

In the following description of the connectivity of the 
present invention, the term "bus" will be used to refer to a 
plurality of signals or conductors which may be used to 
transfer one or more various types of information, such as 
data, addresses, control, or status. The terms "assert" and 
"negate" will be used when referring to the rendering of a 
signal, status bit or similar apparatus into its logically true 
or logically false state, respectively. If the logically true state 
is a logic level one. the logically false state will be a logic 
level zero. And, if the logically true state is a logic level a 
zero, the logically false state will be a logic level one. 

Furthermore, the symbol preceding a number indi- 
cates that the number is represented in its hexadecimal or 
base sixteen form. The symbol preceding a number 
indicates that the number is represented in its binary or base 
two form. 

Referring now to FIG. 1, FIG. 1 illustrates a data pro- 
cessing system 6 in accordance with one embodiment of the 
present invention. Data processing system 6 comprises a 
data processor 3 and an external development system 7. Data 
processor 3 includes a system bus controller 8, a processor 
core 9, a slave module 12, a slave module 13, a master 
module 14, and a debug module 10. System bus controller 
8 is coupled to an external device (not illustrated herein) via 
an E-Bus. System bus controller 8 is coupled to slave 
module 12 and slave module 13 via an S-Bus. The S-Bus is 
a slave module bus. System bus controller 8 is coupled to 
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processor core 9 and master module 14 via an M-Bus 23. 
M-Bus 23 is a master module bus. 

Processor core 9 comprises a central processing unit 
(CPU) 2, a memory management unit (MMU)/controller 4. 

5 and a memory 6. An operand misalignment unit 5 may also 
be selectively included in processor core 9. Central process- 
ing unit 2. MMU/controller 4, memory 6. and debug module 
10 are each coupled to one another via a K-Bus 25. Both 
central processing unit 2 and memory 6 are also directly 

10 connected to MMU/controller 4. MMU/controller 4 pro- 
vides information to a remaining portion of data processing 
system 6 via a M-Bus 23. Central processing unit 2 is 
coupled to debug module 10 to provide an Internal Control 
Bus. CPU 2 also receives a Reset signal from an external 

15 device (not illustrated herein). Debug module 10 bidirec- 
tionally communicates to an external development system 7 
using the external control bus. External development system 
7 is an optional device external to data processing system 5. 
FIG. 2 illustrates a portion of CPU 2 in greater detail 

20 CPU 2 includes a register 208, an instruction fetch pipeline 
210, a buffer 216. an operand execution pipeline 220. and a 
multiplexer (MUX) 222. Instruction fetch pipeline 210 
comprises an instruction address generation circuit 202. an 
instruction fetch circuit 204, a FIFO instruction buffer 206. 

25 and a multiplexer 219. Additionally, operand execution 
pipeline 220 comprises an operand fetch circuit 212, an 
address generation/execute circuit 214. and a pipeline con- 
trol logic 218. 

Instruction address generation circuit 202 of instruction 

30 fetch pipeline 210 provides the Instruction Address signal to 
multiplexer 222. Multiplexer 222 selects between instruc- 
tion addresses from instruction fetch pipeline 210 and oper- 
and addresses from operand execution pipeline 220 for 
subsequent transactions on K-Bus 25. Register 208 outputs 

35 a KADDR (K-Bus Address) signal. Instruction fetch circuit 
204 of instruction fetch pipeline 210 is coupled to operand 
fetch circuit 212 and buffer 216 to receive a buffered 
KRDATA signal. Instruction fetch circuit 204 is coupled to 
operand fetch circuit 212 to provide an Instruction Fetch 

40 Address. Instruction fetch circuit 204 is also coupled to 
FIFO instruction buffer 206 and multiplexer 219 to provide 
an Instruction Buffer Write Data signal, FIFO instruction 
buffer 206 is coupled to multiplexer 219 to provide an 
Instruction Buffer Read Data signal. Multiplexer 219 is 

45 coupled to operand fetch circuit 212 of operand execution 
pipeline 220 to provide an Instruction signal. 

Pipeline control logic 218 of operand execution pipeline 
220 is coupled to op erand fetch circuit 212 to provide an 
Exception Processing signal. Pipeline control logic 218 is 

50 also coupled to address generation/execute circuit 214 to 
provide a control bus. Pipeline control logic 218 is coupled 
to both operand fetch circuit 212 and address generation/ 
execute circuit 214 to provide a Vector signal Address 
generation/execute circuit 214 of operand execution pipeline 

55 220 is coupled to multiplexer 222 to provide an Operand 
Address signal. As well, address generation/execute circuit 
214 is coupled to buffer 216 to provide an Operand Write 
Data signal. Buffer 216 provides a KRDATA (K-Bus Read 
Data) signal and a KWDATA (K-Bus Write Data) signal. 

60 Each of the KRDATA. KWDATA. and KADDR signals are 
provided to K-Bus 25. 

FIG. 3 illustrates operand fetch circuit 212 in greater 
detail. Operand fetch circuit 212 comprises a register 224, a 
multiplexer 1 (MUX) 226. a register file 228. a program 

65 counter register 229, a multiplexer 2 (MUX) 230. a multi- 
plexer 231. a multiplexer 3 (MUX) 232. an adder 233. a 
register A 34, and a register B 236. 
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An Instruction signal is provided to register 224 from operand fetch circuit 212 is coupled to multiplexer 5 252. 

multiplexer 219 of FIG. 2. Register 224 of FIG. 3 is coupled other execute engines circuit 256. and multiplexer 6 258. 

to a first input of multiplexer 1 226 and the Vector signal is Multiplexer 4 250 and multiplexer 5 252 are each coupled to 

coupled to a second input of multiplexer 1 226. Multiplexer ALU 254. ALU 254 is coupled to multiplexer 6 258 to 

1 226 provides constants and immediate operands, as sped- 5 provide the Execute Result signal. Multiplexer 6 258 pro- 
fied by an instruction stored in register 224. to multiplexer ***** an Operand Address signaL An output of ALU 254 is 

2 230 and multiplexer 3 232. An Execute Result signal is Execute Result signal and is coupled to an input of 

coupled to register file 228 and to each of multiplexer 2 230 multiplexer 6 258. Multiplexer 6 258 men provides an 

and multiplexer 3 232. Register file 228 is coupled to Operand Address signal. Logic circuit 253 is coupled to 

multiplexer 2 230 to provide an "A M signal and is coupled to 10 K ^ ^ . and , S !?V S .^ster 257 provides an SR 

multiplexer 3 232 to provide a "B" signal. Addiuonally. R f K giStcr) agnaL ^ 15 ^ Programmable register 

register file 228 provides an SP(1:0) signal. The SP(1:0) t^^'? 10 "?™ ^Y^^ ™* ^ 

• a u% _ a r c? > n • . I control functions. It may also be known as the processor 

agnal provides bite zero and one of a Stack Pointer value status word in other architectures. Additionally, the SP(1:0) 

^ , ^ Muit ?»? 2 230 * C ° Upled 10 *e PC signaL and the SR (Status RegisW) signal are 
» and multiplexer 3 232 is coupled to register is provided to other execute engines circuit 256. Other execute 

B 236. Register A 234 provides a RA signal. Register B 236 engines circuit 256 provides an Operand Write Data signal 

provides a RB signal The RA and RB signals are provided and an Execute Result signal. 

as registered input operands to address generation/execute FIG. 6 illustrates the other execute engines circuit 256 in 

circuit 214. greater detail. Other execute engines circuit 256 comprises 

The Instruction Fetch Address signal is provided to a first 20 a logic 259, a multiplexer 260, a register 261, and a register 

input of multiplexer 231. An output of multiplexer 231 is 262. The SP( 1:0) signal is provided to an input of logic 259. 

provided to program counter register 229. Program counter An output of logic 259 is coupled to multiplexer 260 to 

register provides a PC (Program Counter) signal as an output provide a Format signal. The SR, PC, Format, Vector, and 

and to an input of adder 233. Adder 233 is used to calculate SP(1:0) signals are provided to multiplexer 260. Multiplexer 

a next sequential program counter value with its output 25 260 is coupled to register 262. Register 262 provides the 

connected to a second input of multiplexer 231. As CPU 2 Operand Write Data signal. Additionally, the RB signal is 

executes instructions, the contents of program counter reg- provided to register 261. Register 261 provides a Retrieved 

ister 229 represent the instruction address for the instruction Format signal and also provides the Execute Result signal, 

stored in register 224. Description of Operation 

FIG. 4 illustrates register file 228 in greater detail. Reg- 30 During operation of data processing system 5, processor 
ister file 228 generally comprises all the progranimable core 9 uses K-Bus 25 to connect CPU 2, MMU/Controller 
general-purpose machine registers implemented in CPU. 4, and memory 6. Additionally, in the present invention, a 
Register file 228 is typically comprised of a multiplicity of user may choose to include operand misalignment unit 5. If 
registers, such as a register 240, a register 242, a stack operand misalignment unit 5 is included in data processor 3. 
pointer 244. a multiplexer 246, a multiplexer 248. and AND 35 K-Bus 25 connects it with each of CPU 2, MMU/Controller 
gate 272, and an AND gate 274. The Execute Result signal 4, and memory 6. In the present embodiment of the 
is provided to each of register 240, register 242, and stack invention. K-Bus 25 is a high speed, single-cycle access bus. 
pointer 244. The contents of each of register 240, register Memory 6 may include random access memory (RAM). 
242. and stack pointer 244, as well as the contents of other read only memory (ROM), cache blocks and any combina- 
registers not specifically illustrated herein, but included in 40 tion thereof. All other system modules and peripherals are 
register file 228. are provided to each of multiplexers 246 coupled to processor core 9 via M-Bus 23. M-Bus 23 is an 
and 248. Multiplexer 246 provides an "A" signal and internal multi-master bus which is used to perform data 
multiplexer 248 provides a "B" signal. transfers initiated by one of multiple bus masters. System 
Stack pointer 244 is a programmable register defined as a bus controller 8 provides multiple functions. System bus 
system stack pointer which provides a low order two bits of 45 controller 8 provides an interface between internal M-Bus 
the contents of the stack pointer register as SP(1:0). These 23 and the external E-Bus. if the external E-Bus exists, 
low order bits define the byte address within a thirty-two bit Additionally, system bus controller 8 serves as a focal point 
wide address space. The SP(1) signal provides bit one of a for controlling all data transfers on the S-Bus. The S-Bus is 
Stack Pointer value stored in stack pointer 244 of register file used to connect simple slave peripheral modules (12 and 13) 
228. Similarly, the SP(0) signal provides bit zero of a stack 50 such as timers, serial communication channels, and memo- 
pointer value stored in stack pointer 244 of register file 228. ries into data processing system 5. 
The SP ( 1 ) signal is provided to a first input of AND gate 272 In the present invention, data processor 3 has several 
and mc Exception Processing signal is provided to a second layers of bus bandwidth to provide a cost-effective mecha- 
input of AND gate 272. An output of AND gate 272 is nism for data transfers. Processor core 9 is interconnected 
coupled to an input of multiplexer 248. Additionally, the 55 with a high-speed, single cycle K-Bus 25 to maximize 
SP( 0) signal is provided to a first input of AND gate 274 and performance. For transfers not directly coupled to this 
* e Exception Processing signal is provided to a second high-speed bus. M-Bus 23 provides bandwidth from any of 
input of AND gate 274. the internal bus masters such as processor core 9 and master 
FIG. 5 illustrates address generation/execute circuit 214 module 14. System bus controller 8 provides a connection 
in greater detail. Address generation/execute circuit 214 60 between the internal M-Bus 23 and the external E-Bus (if 
comprises a multiplexer 4 250, a multiplexer 5 252, a logic present), while also providing a control function for all data 
circuit 253, an arithmetic logic unit (ALU) 254, an other transfers on the low-cost, lower bandwidth S-Bus for slave 
execute engines circuit 256, a status register 257, and a peripheral modules such as slave modules 12 and 13. Debug 
multiplexer 6 258. module 10 is connected to K-Bus 25 to allow non-intrusive 
The RA signal provided by operand fetch circuit 212 is 65 observation of all processor-initiated memory accesses, 
coupled to multiplexer 4 250, other execute engines circuit Debug module also provides the connections to the optional 
256, and multiplexer 6 258. The RB signal provided by external development system 7. External development sys- 
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tern 7 is typically connected during program debug and From multiplexer 219 of instruction fetch pipeline 210, 

development during system integration testing. Data proces- the Instruction signal is provided to both pipeline control 

sor 3 is connected to external development system 7 to logic 218 and operand fetch circuit 212 of operand execution 

communicate information. External development system 7 pipeline 220. Additionally, instruction fetch circuit 204 of 

communicates with data processor 3 via the External Con- 5 instruction fetch pipeline 210 provides the Instruction Fetch 

^ kus Address signal to operand fetch circuit 212. This Instruction 

During a following discussion about operation of data Fctch Addr « s * used «° estabUs , h a . ncw 

processing system 5. Lume that a user ofk processor 3 ?°'* t « feg,Ste | , 229 , l^^JZ 

*~ . . , , , . „„™:„.Ji uMfK tfi* instructions, such as taken branches. For simple instructions 

has aetexrmned that ^J*™*™**™*™** ™* * c in involving register operands, the operand execution pipeline 

device should be reduced and has opted to unplement data 10 acts a$ \ ^tagfpipeUne. In*e first stage, pipeline 

processor 3 without operand misalignment unit 5. If it were confrol logic 21g ^codes me instruction and provides 

implemented in data processor 3, operand misalignment umt contro i information to operand fetch circuit 212. 

5 would provide logic circuitry between CPU 2 and K-Bus Concurrently, operand fetch circuit 212 accesses the 

25 that would intercept misaligned references output by required values stored in register file 228. In the second 

CPU 2. Operand misalignment unit 5 would then reconfig- is statc% pipeline control logic 218 provides control signals to 

ure the misaligned references to a series of aligned refer- address generation/execute circuit 214 to perform the 

ences. However, assume in the present embodiment of the required data operation on the registered input operands. RA 

invention, the user does not implement operand misalign- 234 and RA 236 using ALU 254 or the other execute engines 

ment unit 5 in data processor 3. 256. The resulting data is provided on the execute Result 

As previously discussed, by failing to implement operand 20 signal and typically written into register file 228 at the 

misalignment unit 5, a prior art data processor would gen- conclusion of the machine cycle. 

crate an exception upon any attempted reference to a mis- In the present embodiment of the invention, pi peline 

aligned operand If the misaligned reference occurred as part control logic 218 will assert the Exception Processing signal 

of the exception processing for another type of fault, then the to indicate that an exception has occurred. As an example, an 

catastrophic double fault condition occurs. For most data 25 exception may be generated in response to an external 

processing systems, such double exceptions are treated as interrupt request Generation of an exception in a data 

catastrophic failures and often result in the data processor processing environment is well known in the art and will not 

being halted until the system is reset. As previously be discussed in further detail herein. When the exception 

mentioned the present invention recognizes this problem occurs, data processor 3 no longer executes normal proces s- 

and has provided a stack pointer which is self-aligning and 30 ing. Rather, data processor 3 must access and execute a 

therefore, ensures that no misaligned references will be special exception processing routine. During the transition 

generated by CPU 2 in response to an exception. Operation from normal processing to exception processing. CPU 2 of 

of processor core 9 and CPU 2 of the present invention will data processor 3 creates an exception stack frame which 

now be described in greater detail. saves the information detailing the current processor context 

While data processor 3 is operating, processor core 9 35 in system memory. The actual memory locations may be 

controls execution of the program and responds as required memory 6, slave module 12. slave module 13 or a device 

to external interrupts. CPU 2 is provided to quickly and connected to the E-Bus. 

efficiently process instructions with the use of instruction Included in the current processor information stored in the 

fetch pipeline 210 and operand execution pipeline 220. In exception stack frame is a current address of an instruction 

the present embodiment of the invention* instruction fetch 40 being executed at the time the exception occurred. This 

pipeline 210 and operand execution pipeline 220 are inde- exception stack frame is placed on the 4 'top" of the current 

pendent and decoupled system stack residing in system memory. As has been 

During operation, instruction fetch pipeline 210 previously discussed since operands may reside at any byte 

prefetches instructions for execution in processor core 9. boundary, the hardware stack pointer may point to any byte 

Instruction address generation circuit 202 forms a prefetch 45 boundary at the time the exception is detected. In the present 

address to be used in accessing a next instruction. The embodiment of the invention, in which no operand misalign- 

pref etch address is provided to register 208 via multiplexer ment unit is implemented if the stack pointer is misaligned 

222. The prefetch address is provided to K-Bus 25 as the at the beginning of exception processing, the attempted 

KADDR signal. CPU 2 initiates the required K-Bus 25 creation of the exception stack frame will produce another 

transfer to fetch the instruction located at the address defined 50 exception and cause a catastrophic double fault condition 

by KADDR. The desired instruction may be located in which will halt the system's operation, 

memory 6. slave module 12, slave module 13 or a device To ensure mat system operation will not be halted due to 

connected to the E-Bus. if present. Regardless of the loca- a misaligned stack pointer at the time an exception is 

tion of the desired instruction, the data is eventually routed detected the present invention has implemented a circuit 

back to CPU 2 via the KRDATA (K-Bus Read Data) signal. 55 and method for correcting such situations. When an excep- 

The KRDATA signal provides the instruction to buffer 216. tion occurs in the present invention, CPU 2 of data processor 

Buffer 216 subsequently transfers the fetched instruction to 3 first examines a current value of a stack pointer register 

instruction fetch circuit 204. Once the fetched instruction is 244. CPU 2 then automatically aligns it to the nearest 

provided to instruction fetch circuit 204, instruction fetch 0-modulo-4 address below the current value. Once this 

circuit 204 provides the Instruction Buffer Write Data signal 60 adjustment is complete, CPU 2 stores an 8-byte stack frame 

to either FIFO instruction buffer 206 or multiplexer 219. which includes the type and location of the exception in 

FIFO instruction buffer 206 is bypassed and the Instruction system memory, which may be memory 6, slave module 12. 

Buffer Write Data signal is provided directly to multiplexer slave module 13, or another device connected to the E-Bus. 

219 when operand execution pipeline 220 is waiting for a The 8-byte stack frame includes a four bit format field which 

next instruction. Otherwise the Instruction Buffer Write Data 65 contains information about the alignment of the stack 

signal is provided to FIFO instruction buffer 206 and then to pointer at the time of the exception. This operation will now 

multiplexer 219. be described in greater detail. 
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When an ex ception is generated, pipeline control logic constant, -4. to multiplexer 252 of address generation/ 

218 asserts the Exception Processing signal and provides it execute circuit 214 via the RB signal. ALU 254 subse- 

to operand fetch circuit 212. Additionally, pipeline control quently sums the two values to provide an output have a 

logic 218 concurrently executes an exception sequence to valuc of the current stack pointer value (X-4) plus the -4 
save a current processor state into system memory, which 5 constant i.e. (X-8). The output of ALU 254 is provided to 

may be memory 6, slave module 12. slave module 13. or multiplexer 258 via the Execute Result signal. Multiplexer 

another device connected to the E-Bus. . During execution 258 provides the Operand Address signal to multiplexer 222 

of this exception sequence, pipeline control logic 218 and then to register 208. In this manner, the operand address 

enables operand fetch circuit 212 and address generation/ for me KBus 25 c y dc t0 P^ 0 ™ * c sec °n d thirty-two bit 
execute circuit 214 to generate write cycles on K-Bus 25. 10 t X< ?*? n Stack fmD£ slore * 8 eneratcd - Additionally, the 

Hie write cycles will write the information necessary to X 7 8 ^ ?° *^ ecu * Re f w ult Written 

form the stack frame discussed above into memory 6. Note. in ^ s tack pointer register 244 to reflect the top of the system 

^ . , . ^ . j . . J, J ,\ ^ stack at the conclusion of the exception stack frame writes. 

tte term stack frame may be used interchangeably with the Whfle ^ requil J^ ^ ^ 

terra ex^on stack frame generated the data to be stored in the exception stack rrame 

When generating the stack frame, pipeline control logic 15 must also be accessed. As the X-4 address value is being 

218 first provides control information which enables stack formed, the PC value is provided to other execute engines 

pointer register 244 of register file 228 (illustrated in FIG. 4) 256. The PC value is a program counter value of the faulting 

to output an "adjusted* stack pointer value to multiplexer instruction. The value of the PC defines the instruction 

246. This adjustment process involves truncating the current address of the instruction currently in operand execution 

value of the stack pointer register 244 to the nearest 20 pipeline 220 and is accessed from operand fetch circuit 212. 

0-modulo-4 address by forcing the low-order two bits to When the program value is generated and provided to 

zero. This truncation operation is performed by AND gates other execute engines 256 via the PC signal, the PC value is 

272 and 274. D uring normal processing, the provided to multiplexer 260 together with the status register 

Exception Processing signal is negated and, therefore, AND (SR) value, the Format (F) value, and the Vector (V) value, 

gates 272 and 274 simply pass the current value of the 25 Tnc ^ value is generated in response to state information 

low-order two bits. SP(1:0). of stack pointer register 244. generated by ALU 254 of address generation/execute logic 

Once an e xception is detected an d pipeline control logic 218 2U ' ^ state information produced by ALU 254 is output 

asserts the Exception Processing signal, the outputs of AND t0 lo S* c circuit 253 ^ the Execute Result signal. Logic 

gates 272 and 274 are forced to zero, thereby creating a circuit 253 Presses the state information to provide a 

O-raodulo-4 address. Additionally, stack pointer registerlw 30 st **? s y ^ " is ^ in J sta ? JS 257 • Status 

outputs the low-order two bits. SP( l:0)Tto othef execute rcgister 257 subsequently provides the status value to other 

engines 256. The adjusted stack pointer is provided to "^Sf"* f* Vl * * e S f *L r 

multiplexer 246. which passes this value on the A output to ™ C V ^ ^JLK^ * P ^ l0 ? 1C 

multiplexer 2 230 and then the value is loaded into register 21 * toe JW* CUITent excc P t1011 - ™» v ^ 

A 234 33 w "* subsequently used to access a vector defining the 

While me adjusted stacks instruction addressof the exception handling software rou- 

from register file 228 to register A 234, multiplexer 226 5l 0 . , - c u . £ u L , L , £ 

generates a constant of "-4 - The constant. -4. is then Fon f ? Vah,e , 15 ? four blt ficld wtuch dcfines 

transferred to multiplexer 232 and subsequently stored in ^ g J^ nt ° fth c e stack P°J nt « * *c time the exception was 
register B 236. At this point, the adjustedltlck pointer value 40 ^serted The Format value is generated by logic 259 in 

is stored in register A 234 and the constant, -4 is stored in * blts one . and 2ero ° e f *f ^ckpointer value at the 

register B 236. The adjusted stack pointer value corresponds ?"? f * c ™\ followin S table 

to the "X» illustrated in the various examples mustrated in m *" teS , mc rckt | onsm £ bctween mc stack values 
pjQ 7 ^ and the format values effectuated by logic 259. 

Next, register A 234 provides the adjusted stack pointer 45 

value to multiplexer 250 of address generation/execute TABLE 1 

circuit 214 via the RA signal. Similarly, register B 236 
provides the constant. -4, to multiplexer 252 of address 
generation/execute circuit 214 via the RB signal. ALU 254 
subsequently sums the two values to provide an output so 
having a value of the adjusted stack pointer value (X) plus 
the -4 constant i.e.. (X-4). The output of ALU 254 is 
provided to multiplexer 258 via the Execute Result signal, 
from multiplexer 258. the X-4 value is provided to multi- 
plexer 222 as the Operand Address signal. Multiplexer 222 55 Therefore, with reference to Table 1. when SP(1K)) has a 
provides the Operand Address signal to register 208. In this binary value of %00. logic 259 provides a format field 
manner, the operand address for the K-Bus 25 cycle to having a corresponding binary value of 9E>0100. Similarly, 
perform the first thirty-two bit exception stack frame store is when SP(1:0) has a binary value of %0l, logic 259 provides 
generated. a format field having a binary value of %0101. When 

The X-4 address value is also provided on the Execute 60 SP(1:0) has a binary value of %10, logic 259 provides a 
Result signal to multiplexer 230 and then to register A 234. format field having a corresponding binary value of %0110. 
At the start of a next timing cycle, register A 234 stores the Additionally, when SP( 1 :0) has a binary value of %1 1 . logic 
X-4 value and register B 236 stores the constant, -4. Again, 259 provides a format field having a binary value of %011 1. 
pipeline control logic 218 provides control information This format field of the exception stack frame will later be 
which enables register A 234 to provides the X-4 value to 65 used by the processor to restore stack pointer register 244 
multiplexer 250 of address generation/execute circuit 214 back to its original value once the exception has been 
via the RA signal. Similarly, register B 236 provides the processed by the handler routing. 



Original SP(1:0) at the 
Time of the Exception 


SP at die First 
Instruction of the 
Exception Handler 


Format 
Field 


00 


Original SP - 8 


0100 


01 


Original SP • 9 
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10 


Original SP - 10 


0110 
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Original SP - 11 


0111 
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When the exception is being processed, pipeline control exception handling program. From register 208. the excep- 

logic 218 provides control information (not illustrated tion fetch address is provided to K-Bus 25 via the KADDR 

herein) to enable multiplexer 260 to provide one of the SR. signal. 

PC, Format (F). and Vector (V) values to register 262. First, As the initial instruction fetch for the software program 
pipeline control logic 218 provides control information 5 for handling the exception is made and data retrieved, the 
which enables multiplexer 260 to select and transfer the PC instruction data is provided back to CPU 2 via the KRDATA 
value to register 262. From register 262, the PC value is signal. Specifically, the KRDATA signal is provided by 
provided to buffer 216 via the Operand Write Data signal. buffer 216 to instruction fetch circuit 204. Fetched instruc- 
Buffer 216 then drives the PC value to K-Bus 25 via the tions are then gated through multiplexer 219 (optionally 
KWDXTA(K-Bus Write Data) signal. Subsequently, the PC 10 using FIFO instruction buffer 206) to load the fetched 
value is stored in the memory location defined by the X-4 instructions into operand execution pipeline 220. The pro- 
value concurrently generated in the first timing cycle. cess described in detail above initiates and executes the 

During the second timing cycle, in which the X-8 address software program for handling the exception, 

is generated, pipeline control logic 218 provides control When the software program for handling the exception is 

information which enables multiplexer 260 to select and is completed data processor 3 will cease exception processing 

transfer the SR. Format (F). and Vector (V) values to register and return to normal processing. In order to return to normal 

262. From register 262, the SR. Format and Vector values processing, the context in which data processor 3 was 

are concatenated in an order illustrated in FIG. 7 and operating before the exception occurred must be restored 

provided to buffer 216 via the Operand Write Data signal. Therefore, the program counter, the status register, and the 

Buffer 216 then drives the concatenated value to K-Bus 25 20 stack pointer must be restored to the original values they had 

via the KWDATA (K-Bus Write Data) signal. Subsequently. before the exception occurred. The unique use of the excep- 

the Format/Vector/Status Register value is stored into a tion stack frame for storing the format field which defines 

system memory location defined by the X-8 value concur- the alignment of the stack pointer at the time of the excep- 

rently generated in the second timing cycle. tion is essential for correct and efficient restoration of data 

After the exception stack frame is generated and the PC. 25 processor 3 to the previous state. 

SR. Format and Vector values are stored in system memory. The last instruction in the software program for handling 

the exception is then processed by data processor 3. At this an exception must be an RTE (Return from Exception) 

point, the Vector value provided by pipeline control logic instruction. When the KTE instruction is executed, the two 

218 is provided to multiplexer 226 in operand fetch circuit longword values stored on the exception stack frame (as 

212. From multiplexer 226. the Vector value is provided to 30 illustrated in FIG. 7) will be retrieved. To retrieve these 

multiplexer 230 and then stored in register A 234. From values, a current stack pointer value is retrieved from stack 

register A 234, the Vector value is provided to multiplexer pointer register 244 of register file 228. Remember, at the 

250 via the RA signal. Concurrent with the Vector beginning of the software execution program, the value of 

generation, a Vector Base Register (VBR) is read in other the stack pointer was X-8. Although the stack area may be 

execute engines 256 and sent to multiplexer 232 via the 35 used for temporary storage during execution of the excep- 

Execute Result signal. The VBR provides the ability to tion handling program, the stack pointer must again contain 

"relocate" the exception vector table within the system a value of X-8 as the PTE instruction begins execution. The 

memory. The VBR value is gated through multiplexer 232 stack pointer value is provided to multiplexer 246 to be 

and loaded into register B 236. The Vector value is then communicated via the A signal. The A signal is provided to 

summed with the VBR value in ALU 254 to generate a 40 multiplexer 230 and men transferred to register A 234. At the 

vector address which defines the location in the system same time, multiplexer 226 generates a constant value of +4. 

memory containing the instruction address which corre- The constant value. 4. is transferred from multiplexer 226 to 

sponds to the generated exception. The sum provided by multiplexer 232 and stored in register B 236. 

ALU 254 is provided to multiplexer 258. When appropriate In a next timing cycle, the contents of register A 234. the 

control is provided to multiplexer 258 by pipeline control 45 X-8 value, are provided to multiplexer 258 by the RA 

logic 218. multiplexer 258 provides the vector address as the signal. Multiplexer 258 then provides the X-8 value to 

Operand Address signal. The Operand Address signal is multiplexer 222 via the Operand Address signal, from 

subsequently provided to multiplexer 222. From multiplexer multiplexer 222. the X-8 value is stored in register 208. The 

222. the vector number is output as the KADDR signal by X-8 value is subsequently provided to K-Bus 25 via the 

buffer 208. The KADDR signal is used by K-Bus 25 to read 50 KADDR signal. Data is then accessed at the address location 

the desired exception vector. in system memory specified by the X-8 value. 

The retrieved vector is transferred to buffer 216 via the Concurrent with the formation of X-8 as the operand 

KRDATA signal. From buffer 216, the retrieved vector is address, register A 234 containing the X-8 value and register 

provided to operand fetch circuit 212 by the KRDATA B 236 containing the -+4 value are respectively transferred to 

Buffer signal. The KRDATA Buffer signal is connected to 55 multiplexer 250 and multiplexer 252. Under control pro- 

the Execute Result signal to transfer the retrieved vector. vided by pipeline control logic 218. multiplexer 250 and 

The retrieved vector is subsequently provided to multiplexer multiplexer 252 transfer the X-8 value and the constant 4 to 

230 and stored in register A 234. From register A 234, the ALU 254. ALU 254 adds the two values to generate a result 

retrieved vector is provided to multiplexer 250 via the RA of X-4. The result is provided to the Execute Result signal 

signal Multiplexer 250 transfers the retrieved vector to ALU 60 which then transfers the X-4 value to multiplexer 230. 

254 where it is summed with zeroes produced by multi- Multiplexer 230 provides the X-4 value to register A 234. 

plexer 252 under control of pipeline control logic 218. ALU Referring to FIG. 7. when the address location specified 

254 essentially passes the retrieved vector onto the Execute by the X-8 value is accessed, the Format (F). Vector (V), 

Result signal where it provides an input to multiplexer 258 and SR values are accessed. The Format, Vector, and SR 

and then multiplexer 222. The retrieved vector is then loaded 65 values are provided to buffer 216 via the KRDATA signal 

into KADDR register 208. This address represents the From buffer 216, the Format Vector, and SR values are 

instruction fetch address for the starting instruction for the provided to multiplexer 232 of operand fetch circuit 212 via 
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the Buffer KRDATA signal and the Execute Result signal. Consider the following example taken from a well-known 
These values are then gated in register B 236. From register benchmark: 
B 236. the Format. Vector and SR values are provided to 
multiplexer 252 via the RB signal. The entire contents of 
register B 236 are subsequently passed through ALU 254 
without modification and processed by logic circuit 253 to 
provide a status value. The status value is then stored in 
status register 257 to indicate an operating context of data 
processor 3 when the exception occurred. Similarly, from 
register B 236. the Format value is provided to other execute 
engines 256 via the RB signal The Format value is subse- 10 ^ stents of the stack pointer before executing the 
quently stored in register 261. instruction at address $00005064 is $00006ed0. The mov.b 

In a subsequent timing cycle, the X-4 value stored in opcode at address $00005064 copies one byte of data (let the 
register A 234 is provided to multiplexer 258 by the RA $aa) from a source Location defined by the contents 

signal. The RA signal transfers the X-4 value to multiplexer of register aO onto the system stack and decrements the stack 
222 and then to register 208. The X-4 value is subsequently 15 Pointer by 1. At the conclusion of this instruction, the stack 
provided to K-Bus 25 via the KADDR signal. Data is then pointer value is $00006ecf. 

accessed at the address location in system memory specified Suppose an interrupt request then arrived, causing the 
by the X-4 value. execution of the instruction at address $00005056 to be 

Referring again to FIG. 7, when the address location postponed to service the interrupt request. As the interrupt 
specified by the X-4 value is accessed, the program counter 20 exception is processed, the following exception stack frame 
(PC) value is accessed. The PC value is provided to buffer is created: 
216 via the KRDATA signal From buffer 216. the PC value 
is provided to operand fetch circuit 212 via the Buffer 
KRDATA signal. The PC value is gated onto the Execute 



Address Data 



Result signal and routed to multiplexer 232, where it is 25 $oooo6eo4 $70780008 #fannai = $7, vector = $078. status = $ooce 

passed and loaded into register B 236. At this point in time. $oooofec8 $00005066 tinstrctioa address 

register A 234 contains the X-4 value and register B 236 $00QQ6ccc * aa #to copied by mov.t opcode 

contains the new program counter address. ™~" ~ " "" ™ — ~ ~ ~ ~ — — ^— — 

In the next timing cycle, the register B 236 value, con- where (he stack pointer register value at the beginning of 
taining the new program counter address, is transferred to 30 the exception handling software program is $00006ec4. 
multiplexer 2S8. This address is then transferred to mum- After the interrupt request is serviced, the exception 
plexer 222 via the Operand Address signal where it is loaded handler executes an RTE instruction. The 32 bits of data at 
into register 208. This address defines the instruction address the top of the stack is fetched and the Format, Vector and SR 
to resume execution after processing the exception. The fields processed by the CPU 2 as previously described. The 
KADDR register 208 is then used to access the desired 35 interim X-4 address of $00006ec8 is formed and tempo- 
instruction from system memory to resume the execution of rariiy saved in register A 234. The instruction address of 
the program. The desired instruction is returned on $00005066 is fetched from location $00006ec8 and used to 
KRDATA. passed through buffer 216 into instruction fetch establish a new instruction stream. Concurrently, the saved 
circuit 204. format field ($7) is summed in ALU 254 with the contents 

In a subsequent timing cycle, the Format value is retrieved 40 of register 234 ($000006ec8) to form the restored stack 

from register 261 via the Execute Result signal TTus value pointer ($00006ec8+$00000007=$00006ecf). which is 

is men sent to multiplexer 232 and loaded into register B loaded into the stack pointer register 244 of the register file 

236. At this time, register A 234 still contains the X-4 value. 228. Control then resumes with the execution of the instruc- 

and register B contains the Format field In the next timing tion at address $00005066. 

cycle, the contents of both multiplexer 250 and multiplexer 45 In summary, the present invention provides a data pro- 

252 are provided to ALU 254 which sums the two values, cesser and method of operating a data processing system in 

i.e.. X-4+FormaL which a single system stack pointer may be used to create 

The summation recreates the original stack pointer value records of both supervisor and user stack operations when 

at the time of the exception. The ALU output is gated onto hardware support for alignment of such stack operands is 

the Execute Result signal and stored in stack pointer register so optional. The single system stack pointer is implemented as 

244 of register file 228 as the current stack pointer. a self-aligning stack pointer which automatically aligns 

At this point the data processor 3 has completely restored itself to a nearest 0-moduio-4 address below the current 

the state of the machine back to its value at the time of the address setting such that no system errors occur even when 

exception: the instruction fetch pipeline 210 is fetching there is no hardware support for misaligned operands. Once 

instructions at the restored program counter value, and the 55 the automatic alignment has occurred, the data processor of 

stack pointer register 244 has been restored to the value at the present invention stores a four bit format field in an 

the time of the exception. The seff-ah'gning characteristics of exception stack frame to indicate information about the 

the current invention enable the stack pointer to automati- alignment of the stack pointer at the time of the fault When 

cally align itself to the nearest 0-modulo-4 address below the the exception has been serviced and a return from exception 

current stack pointer setting when an exception is received. 60 (KTE) instruction is executed, the processor uses the four bit 

When the exception has been processed and data processor format field stored in the exception stack frame to restore the 

3 is beginning to operate in a normal mode of operation, the stack pointer to its original value at the time of the exception, 

stack pointer must be returned to its original value, even if The implementation of the invention described herein is 

that value was misaligned at the time the exception was provided by way of example only and many other imple- 

received. The addition of the format field to the interim X-4 65 mentations may exist for executing the function described 

value of the stack pointer provides a unique and efficient herein. For example, the constant generated and used to 

method for returning the stack pointer to its initial value. create addresses for the exception stack frame is not con- 



10/21/2003, EAST version: 1.4.1 



5.761.491 



IS 



16 



strained to be -4. but may be any number desired by a 
designer or user of data processing system 5. 

While the present invention has been illustrated and 
described with reference to specific embodiments, further 
modifications and improvements will occur to those skilled s 
in the art It is to be understood, therefore, that this invention 
is not limited to the particular forms illustrated and that the 
appended claims cover all modifications that do not depart 
from the scope of this invention. 

We claim: 10 

1. A method for operating a data processing system, 
comprising the steps of: 

receiving a first signal within the data processing system 
indicating a first predetermined condition has occurred; 

retrieving a current address value from a stack pointer 13 
register; 

aligning the current address value to a nearest 0-modulo-4 
address next to the current address value; and 

storing a format field value in a memory, the format field 20 
indicating a manner in which the current address value 
was aligned at the time of receiving the first signal 
indicating the first predetermined condition has 
occurred. 

2. The method of claim 1 wherein the first predetermined 2 5 
condition is a exception processing condition. 

3. The method of claim 1 wherein a step of aligning, 
further comprises the steps of: 

negating a predetermined portion of the current address 
value retrieved from the stack pointer register. 30 

4. The method of claim 1 further comprising the steps of: 
receiving a second signal within the data processing 

system indicating a second predetermined condition 
has occurred; 

retrieving the format field value from the memory; and 35 
restoring the current address value to the stack pointer 
register. 

5. A data processor, comprising: 

bus means for providing a plurality of address values and 40 

a plurality of data values to the data processor; 
control means for determining when an exception in 

operation has occurred and providing an exception 

processing signal in response thereto; 
a register for indicating a current address value accessed 45 

when the exception in operation has occurred; 
a logic circuit coupled to the register for receiving a first 

portion of the current address value and coupled to the 



control means for receiving the exception processing 
signal, the logic circuit selectively modifying the cur- 
rent address value to form an adjusted address value; 
and 

a format generation circuit coupled to the register for 
receiving at least the first portion of the current address 
value, the format generation circuit modifying the first 
portion of the current address value to generate a stack 
alignment value to indicate an alignment of the current 
address value when the exception in operation has 
occurred. 

6. The data processor of claim 5 further comprising a 
memory coupled to the format generation circuit for storing 
the stack alignment value. 

7. The data processor of claim 5 wherein the current 
address value is stored in the register when the exception in 
operation has occurred and been completed. 

8. The data processor of claim 5 the adjusted address 
value is a nearest 0-modulo-4 address less than the current 
address value. 

9. A data processor, comprising: 

control means for generating a first data processor inter- 
rupt signal; 

a stack pointer register for storing and providing a current 
address value having a first portion and a second 
portion; 

a logic circuit coupled to the control means for receiving 

the first interrupt signal and coupled to the stack pointer 

for receiving a first portion of the current address value. 

the logic circuit creating an adjusted address value; and 
a converter circuit coupled to the stack pointer register for 

receiving a second portion of the current address value. 

the converter circuit generating a format field which. 

when summed with the adjusted address value will 

generate the current address value. 

10. The data processor of claim 9 wherein the format field 
is stored in a memory of the data processor. 

11. The data processor of claim 10 wherein the format 
field is retrieved from the memory of the data processor and 
used to restore the current address value to the stack pointer 
register. 

12. The data processor of claim 9 wherein the adjusted 
address value is a nearest 0-modulo-4 address value less 
than the current address value. 
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